Search CORE

8 research outputs found

Integration of Riemannian Motion Policy and Whole-Body Control for Dynamic Legged Locomotion

Author: Kim Donghyun
Lvovsky Misha
Marew Daniel
Sessions Shotaro
Yu Shangqun
Publication venue
Publication date: 07/10/2022
Field of study

In this paper, we present a novel Riemannian Motion Policy (RMP)flow-based whole-body control framework for improved dynamic legged locomotion. RMPflow is a differential geometry-inspired algorithm for fusing multiple task-space policies (RMPs) into a configuration space policy in a geometrically consistent manner. RMP-based approaches are especially suited for designing simultaneous tracking and collision avoidance behaviors and have been successfully deployed on serial manipulators. However, one caveat of RMPflow is that it is designed with fully actuated systems in mind. In this work, we, for the first time, extend it to the domain of dynamic-legged systems, which have unforgiving under-actuation and limited control input. Thorough push recovery experiments are conducted in simulation to validate the overall framework. We show that expanding the valid stepping region with an RMP-based collision-avoidance swing leg controller improves balance robustness against external disturbances by up to

53\%

compared to a baseline approach using a restricted stepping region. Furthermore, a point-foot biped robot is purpose-built for experimental studies of dynamic biped locomotion. A preliminary unassisted in-place stepping experiment is conducted to show the viability of the control framework and hardware

arXiv.org e-Print Archive

A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems

Despite the advancement of machine learning techniques in recent years, state-of-the-art systems lack robustness to "real world" events, where the input distributions and tasks encountered by the deployed systems will not be limited to the original training context, and systems will instead need to adapt to novel distributions and tasks while deployed. This critical gap may be addressed through the development of "Lifelong Learning" systems that are capable of 1) Continuous Learning, 2) Transfer and Adaptation, and 3) Scalability. Unfortunately, efforts to improve these capabilities are typically treated as distinct areas of research that are assessed independently, without regard to the impact of each separate capability on other aspects of the system. We instead propose a holistic approach, using a suite of metrics and an evaluation framework to assess Lifelong Learning in a principled way that is agnostic to specific domains or system techniques. Through five case studies, we show that this suite of metrics can inform the development of varied and complex Lifelong Learning systems. We highlight how the proposed suite of metrics quantifies performance trade-offs present during Lifelong Learning system development - both the widely discussed Stability-Plasticity dilemma and the newly proposed relationship between Sample Efficient and Robust Learning. Further, we make recommendations for the formulation and use of metrics to guide the continuing development of Lifelong Learning systems and assess their progress in the future.Comment: To appear in Neural Network

arXiv.org e-Print Archive

Loughborough University Institutional Repository

Q-functionals for Value-Based Continuous Control

Author: He Bowen
Konidaris George
Lobel Samuel
Rammohan Sreehari
Yu Shangqun
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 26/06/2023
Field of study

We present Q-functionals, an alternative architecture for continuous control deep reinforcement learning. Instead of returning a single value for a state-action pair, our network transforms a state into a function that can be rapidly evaluated in parallel for many actions, allowing us to efficiently choose high-value actions through sampling. This contrasts with the typical architecture of off-policy continuous control, where a policy network is trained for the sole purpose of selecting actions from the Q-function. We represent our action-dependent Q-function as a weighted sum of basis functions (Fourier, Polynomial, etc) over the action space, where the weights are state-dependent and output by the Q-functional network. Fast sampling makes practical a variety of techniques that require Monte-Carlo integration over Q-functions, and enables action-selection strategies besides simple value-maximization. We characterize our framework, describe various implementations of Q-functionals, and demonstrate strong performance on a suite of continuous control tasks

Association for the Advancement of Artificial Intelligence: AAAI Publications

Suppression Mechanism of TiO2 for the Partial Discharge of Oil-paper Insulation in Intensive Electric Field

Author: Li Jiachen
Li Xiaolong
Liu Daosheng
Wu Yajie
Xu Xiangdong
Ye Jing
Yu Shangqun
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study

With the rapid development of modern HVDC transmission technology, higher insulation properties are put forward on the oil-paper insulation system of the transformer, which determine the transformer service life to a certain extent. Traditional transformer oil-paper insulation is becoming increasingly difficult to meet the demands of insulation system with large capacity and miniaturization at ultra-high voltage level. In order to improve the insulation strength of oil-paper system, the insulation cellulose paper modified by TiO2 nanoparticles of different diameters (5 nm, 10 nm, 20 nm, 30 nm) were prepared, in addition, each of modified cellulose paper has different mass fraction of TiO2 nanoparticles (1%, 3%, 5%, 7% wt.). The partial discharge (PD) detection platform was established, and the partial discharge inception voltage (PDIV) values of the oil-paper insulation system with and without nanoparticles were measured. To investigate the PD characteristics, the PD waveforms and PD frequency spectrums of modified cellulose paper and the unmodified were obtained. The suppression mechanism of TiO2 nanoparticles on PD was explored through scanning electron microscope (SEM) observation. All the experiment results indicate that adding nano-TiO2 is beneficial to enhance the insulation properties of oil-paper insulation, and the optimum diameter and mass fraction of TiO2 nanoparticles to suppress oil-paper PD were obtained

Crossref

Chalmers Research

Comprehensive Carbon Emission and Economic Analysis on Nearly Zero-Energy Buildings in Different Regions of China

Author: Haizhu Zhou
Jianlin Wu
Minchao Fan
Shangqun Xie
Shilei Lu
Xiaolong Xu
Yashuai Yang
Yiting Kang
Zhen Yu
Zheng Fu
Publication venue: 'MDPI AG'
Publication date: 09/08/2022
Field of study

Considering the comprehensive effect of building carbon emissions, cost savings is of great significance in nearly-zero-energy buildings (NZEBs). Previous research mostly focused on studying the impact of technical measures in pilot projects. The characteristics of different cities or climate zones have only been considered in a few studies, and the selection of cities is often limited. At times, only one city is considered in each climate zone. Therefore, this study selected 15 cities to better cover climate zone characteristics according to the variation in weather and solar radiation conditions. A pilot NZEB project was chosen as the research subject, in which the energy consumption was monitored and compared across different categories using simulated values by EnergyPlus software. Various NZEB technologies were considered, such as the high-performance building envelope, the fresh air heat recovery unit (FAHRU), demand-controlled ventilation (DCV), a high-efficiency HVAC and lighting system, daylighting, and photovoltaic (PV). The simulated carbon emission intensities in severe cold, cold, and hot summer and cold winter (HSCW) climate zones were 21.97 kgCO2/m2, 19.60 kgCO2/m2, and 15.40 kgCO2/m2, respectively. The combined use of various NZEB technologies resulted in incremental costs of 998.86 CNY/m2, 870.61 CNY/m2, and 656.58 CNY/m2. The results indicated that the HSCW region had the best carbon emission reduction potential and cost-effectiveness when adopting NZEB strategies. Although the incremental cost of passive strategies produced by the envelope system is higher than active strategies produced by the HVAC system and lighting system, the effect of reducing the building’s heating load is a primary and urgent concern. The findings may provide a reference for similar buildings in different climate zones worldwide

Multidisciplinary Digital Publishing Institute

Comprehensive Carbon Emission and Economic Analysis on Nearly Zero-Energy Buildings in Different Regions of China

Author: Haizhu Zhou
Jianlin Wu
Minchao Fan
Shangqun Xie
Shilei Lu
Xiaolong Xu
Yashuai Yang
Yiting Kang
Zhen Yu
Zheng Fu
Publication venue: MDPI AG
Publication date: 01/08/2022
Field of study

Directory of Open Access Journals

Synthesis of illite/iron nanoparticles and their application as an adsorbent of lead ions

Author: BD Yirsaw
BW Gu
C Jing
C Yu
Chuang Yu
G Wang
H Aghdasinia
H Jiang
HN Tran
HX Zhang
JC Shao
KC Bedin
L Yuan
LN Shi
M Fernandes
N Arancibia-Miranda
N Ezzatahmadi
R Fu
R Ravi
S Gamoudi
S Hamid
SA Kim
Saruchi
Shangqun Li
T Mwamulima
V Montoya
W Liu
W Wang
W Wang
X Liu
X Zhang
Xiaoniu Yu
Xiaoqing Cai
Xihua Yu
Y Li
YF Xi
YG Chen
YS Ho
Zexiang Wu
Ç Üzüm
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A domain-agnostic approach for characterization of lifelong learning systems

Author: Abrar Rahman (14607977)
Alexander New (14607917)
Andrea Soltoggio (1248822)
Andrew P Brna (12320666)
Angel Yanguas-Gil (2161717)
Anurag Daram (12320672)
Aswin Raghavan (14607974)
Cassandra Kent (14607956)
Christine Piatko (14607971)
Dhireesha Kudithipudi (12320660)
Eric Eaton (14607941)
Eric Q Nguyen (14607968)
Erik Learned-Miller (69776)
Eseoghene Ben-Iwhiwhu (6115286)
Ethan Brooks (14607926)
Fabien Delattre (14607935)
Felix Wang (12320723)
Gautam K Vallabha (14608001)
George Konidaris (14607959)
Haotian Fu (14607944)
Harel Yedidsion (14607995)
Indranil Sur (14607986)
Jesse Hostetler (14607950)
Jorge A Mendez (14607965)
Kristen Grauman (14607947)
Kyle Vedder (14607992)
Mario Aguilar-Simon (12320663)
Megan M Baker (14607914)
Michael L Littman (14607962)
Neale Ratzlaff (14607983)
Nicholas Ketz (418685)
Peter Stone (62420)
Praveen K Pilly (8612634)
Ryan C Brown (14607929)
Ryan Dellana (14607938)
Saket Tiwari (14607989)
Sandeep Madireddy (12320687)
Santhosh Kumar Ramakrishnan (14607980)
Seungwon Lee (3768508)
Shangqun Yu (14607998)
Shariq Iqbal (14607953)
Soheil Kolouri (8612628)
Sébastien MR Arnold (14607923)
Zachary Daniels (14607932)
Zhipeng Tang (1937923)
Ziad Al-Halah (14607920)
Zifan Xu (13805933)
Publication venue
Publication date: 20/01/2023
Field of study

Despite the advancement of machine learning techniques in recent years, state-of-the-art systems lack robustness to “real world” events, where the input distributions and tasks encountered by the deployed systems will not be limited to the original training context, and systems will instead need to adapt to novel distributions and tasks while deployed. This critical gap may be addressed through the development of “Lifelong Learning” systems that are capable of (1) Continuous Learning, (2) Transfer and Adaptation, and (3) Scalability. Unfortunately, efforts to improve these capabilities are typically treated as distinct areas of research that are assessed independently, without regard to the impact of each separate capability on other aspects of the system. We instead propose a holistic approach, using a suite of metrics and an evaluation framework to assess Lifelong Learning in a principled way that is agnostic to specific domains or system techniques. Through five case studies, we show that this suite of metrics can inform the development of varied and complex Lifelong Learning systems. We highlight how the proposed suite of metrics quantifies performance trade-offs present during Lifelong Learning system development — both the widely discussed Stability-Plasticity dilemma and the newly proposed relationship between Sample Efficient and Robust Learning. Further, we make recommendations for the formulation and use of metrics to guide the continuing development of Lifelong Learning systems and assess their progress in the future

Loughborough University Institutional Repository